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Sunnary 

The  consistency  of  commonly  used  variance  estimates  for  studentizing 
robust  estimates  of  location  is  studied  when  the  underlying  distributions 
are  asymctric.  Surprisingly , the  popular  methods  for  studentizing  Id- 
estimates,  jackknifed  il- estimates  and  adaptive  trimed  roans  underestimate 
the  true  variance  by  a factor  approaching  50%  under  the  negative  exponential 
distribution.  Possible  reasons  for  the  failures  and  suggested  modifications 
are  ere seated. 
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Introduction 

This  project  started  with  the  nroblen  of  sequential  fixed -width 
interval  estimation  of  the  1 bait in?  functional  of  robust  estimators, 
be  first  compared  a nunber  of  estimators  in  many  symmetric  situations 
with  predictable  results  (concerning  robustness  of  efficiency)  which 
v/ill  not  be  reported  here  (see  Andrews ; et  al  (1972),  Carroll  and  begnan 
(1975)  ).  ’hen  the  sampling  distribution  was  the  negative  exponential, 
the  situation  was  surprisingly  degenerate;  the  modem  robust  estimators 
all  gave  much  lower  coverage  probabilities  than  advertised.  After  some 
study,  we  discovered  that  the  commonly  used  variance  estimates  under- 
estimated the  true  variance  by  nearly  501,  so  that  too  few  observations 
were  being  taken.  In  the  context  of  the  two-sample  location  problem, 
this  means  the  Type  I error  is  much  higher  than  desired.  Before  routine 
application  of  robust  two -sample  tests  can  be  recommended,  it  is  important 
to  discover  where  the  studentization  fails  and  to  suggest  appropriate 
modifications  to  existing  procedures. 

This  paner  describes  the  results  of  a Tdnte- Carlo  experiment,  comparing 
true  and  estimated  variances  in  asymmetric  situations,  be  show  that  in 
asymmetric  cases  such  as  the  negative  exponential  distribution,  hard 
adaption  (Hogg  (1974)  ),  (common)  H-estimators  and  jackknifed  M-estimtorS 
all  fail.  .Our  major  initial  success  was  obtained  with  -trimmed  means.  The 
reasons  for  the  occurrence  of  the  observed  phenomena  are  investigated,  afld 
recommendations  for  future  study  are  made. 
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The  Estimators 

Define  U(a)  (L(a))  as  tho  mac::  o'  the  largest  (smallest)  [a(n+l)] 
order  statistics.  As  a measure  of  tail  le^th,  Hogg  proposed 

Q = (U(.20)  - L(.20))/(U(.50)  - L(.50)) . 

Hampel 1 s seals  is  defined  by 

s^  = r.ecian{|X-  - sample  median | }/. 6754, 

with  the  property  tliat  sn  -*•  1 in  the  normal  case.  In  Table  1 we  list 
the  estimators  of  location  used  in  this  study.  The  variance  estimate 
for  the  sample  mean  M is  the  usual  sample  variance,  while  for  the  trimed 
means  5?  - 33?  it  is  the  jackknifed  variance  estimate  (Shorack  (1974)). 
For  the  one-step  iiarpels  and  Hubers  (D19-D20,  12A-25A,  HUB)  it  is  (Huber 
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For  the  adaptors  (HG1 , KG2 , 1.31D,  1.90D,  2.00A,  1.81D),  the  variance 
estimate  of  the  c’ncsen  location  estimate  is  used.  For  the  jackknifed 
ibber  (JHU3),  the  jackknifed  variance  estimate  is  used. 
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TABLE  1 

A description  of  the  location  estimates  used  in  the  study. 

Description 
Sample  near. 


a%  means  an  a'  syr.netrically  trimed 
nean.  38%  has  a = .375  . 


f 5%  Qsl.81 

\ 10%  if  1.31<Osl.87 
25%  0>1.87 


(.05+ (0-1. 30) (4/3)) 100% 

25% 


0*1.80 
if  1.30<Q*1.95 
0>1 . 95 


One- step  Huber  estimates,  with  start  being  the 
sample  median,  scale  s (Andrews,  et  al  (1972)  ), 

The  knot  in  the  ijj  function  for  DK  is  K/10. 

D20  Qsl.81 

D15  1,81<Q*1.S7 

D10  0>1.87 

D20  0<1. 90 

D15  if  1.90<0*2.05 
D10  Q>2.05 

Solution  (via  method  of  bisection  with  20 

iterations)  to  Xty(s^(X.^-t))  = 0,  inhere 

ii/(x)  = max(-1.5,  r:in(x,1.5)} . 

A j ac]  deni  fed  version  of  r?JB,  with  n = 50. 

Qne-steo  Hampel  estimates,  with  knots 
given  in  Andre1  rs , et  al  (1972).  Start  is 
the  sample  median,  scale  is  sn  . 

21A  if  Q<2 . 00 
12A.  if  Cfe2. 00 

25A  Qsl.81 

2 LA  if  1.81<0*1.87 
12A  Q>1.87 
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Rarvlon  : bribers 

29 

A congruent  ial  random  number  generator  (period  >2  ) with  a shuffling 

feature  was  used  to  obtain  the  uniform  random  variables,  while  standard 
normal  observations  were  obtained  by  the  Box -duller  algorithm.  The  shuffler 
first  generated  300  uniform.  deviates  and  then  chose  one  of  them  at 
random;  this  effectively  overcomes  the  undesirable  dependence  features 
of  the  usual  congruential  generator.  ITo  'bnte- Carlo  swindle  was  used, 
so  reported  results  will  be  accurate  to  about  one  decimal  place,  more 
then  enough  to  make  the  conclusions  given  here. 


Sarolinc  Distributions 

Ten  sampling  distributions  were  studied.  If  0 is  the  standard 
normal  distribution  function,  the  first  six  had  distributions  as  follows: 

N(0,1) 

M(0,l)  + ,05iT(l,l) 
H(C,1)+, 1011(1,1) 
N(0,1)+.05N(3,1) 
11(0,1)+. 10N(3.1) 

NE 

Letting  X be  a standard  normal  random,  variable,  the  last  four  were 
generated  by 


Type 

*(x) 

.9S*(x)+.05&(x-l) 

.90*(x)+.10$(x-l) 

.9S$(x)+.05q(x-3) 

. 904>(x)+.100(x-3) 

'egative  exponential  vdth  mean  1.25 


12. 5exp(. 10X) 
4.38exp(.25X) 
2.46exp(. 50X) 
.49exp(X) 


Exp(.lOX) 
Exp(.  25X) 
Exp ( . 50X) 
Bxp(X) 


Sanpl  '=>"  of  size  n “ 7S  wre  taken  for  each  iteration.  There  were 
500  iterations  on  N(0,1),  1000  iterations  on  ITE  and  Exp(X),  and 

250  iterations  on  the  others. 
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Oajor  Results 


This  paper  focuses  on  the  effects  of  asymmetry  on  the  consistency 
of  variance  estimates  of  robust  estimators.  If  n = 75  is  the  sarrple 
size,  ’I  is  the  number  of  experiments,  and  Yp  ...  ,Yjj  the  realized 
values  of  a particular  estimator,  the  (standardized)  Ibnte -Carlo  variance 
of  the  estimator  is 


2 

o 


n S Cfi  - V 


The  average  value  of  a variance  estimate  for  a particular  location  estimator 
? 2 2 

is  denoted  by  cT  . Tabic  2 presents  values  of  o^/o  for  all  ten  sampling 

2 2 

distributions.  Table  3 presents  values  of  o and  for  N(0,1),  NE, 
and  Exp  O'),  the  first  acting  as  a standard,  with  the  latter  two  representing 
worst  case  asymmetric  distributions. 
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Values  of  the  ratio  a‘7a 
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/.  consistent  variance  estimate  should  have 
tnis  vaiua  near  KGS. 
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.97 

.39 

.94 

1.11 

1.00 

1.01 

.96 

.97 

.93 

1.00 

5% 

.95 

.97 

.33 

1.14 

1.10 

.96 

.90 

1.02 

.97 

1.05 

10% 

.94 

.55 

.92 

1.15 

1.06 

.97 

.33 

1.04 

.97 

1.04 

25% 

.93 

.96 

.93 

1.10 

.99 

1.00 

.36 

1.05 

.94 

.99 

70  e. 

J)  i) 

.92 

1.04 

1.00 

1.13 

1.02 

1.02 

.33 

1.02 

.94 

.98 

H51 

.90 

.92 

.90 

1.06 

.30 

.53 

.84 

.97 

.70 

.62 

HG2 

.97 

.94 

.97 

.98 

.97 

.60 

1.08 

.37 

.79 

.62 

DIO 

.90 

1.03 

.94 

1.09 

.55 

.54 

.89 

.83 

.32 

.52 

D15 

.95 

1.02 

.55 

1.12 

.97 

.59 

.52 

.38 

.75 

. 53 

D20 

.97 

1.C2 

. 95 

1.13 

.98 

.68 

.95 

.51 

.73 

.59 

1.81P 

.91 

1.04 

. 95 

1.11 

.95 

.59 

.91 

.90 

.32 

.54 

1. 900 

.97 

1.02 

.95 

1.12 

.55 

.59 

.94 

.89 

.74 

.42 

I?JB 

1.07 

.99 

.84 

.93 

.93 

.60 

.03 

.81 

.67 

.52 

Ji7JB 

1.00 

-- 

-- 

-- 

-- 

.61 

-- 

-- 

-- 

.53 

12A 

.97 

.05 

. 52 

1.12 

.98 

.49 

.90 

.34 

.75 

.44 

2IA 

1.01 

.96 

.54 

1.12 

1.01 

.62 

.91 

.90 

.51 

.50 

25A 

1.01 

.97 

.93 

1.11 

1.03 

.70 

.93 

.93 

.36 

.55 

2.00A 

1.01 

.96 

.94 

1.11 

• 61 

.91 

.90 

.80 

.35 

1.31A 

1.00 

.94 

.92 

1.03 

.90 

.42 

.91 

.84 

.65 

.34 
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I.  (Trimed  Means) 

The  symmetrically  trimed  means  i1  (or  Dij , 51  - 38%  were  the 

only  estimators  from  Table  1 v:hosc  variance  estimates  consistently  estimated 
the  true  variance  over  the  v/hole  range  of  distributions,  'fence,  if  the 
user  has  serious  concern  about  asymmetry,  trimed  means  ;jould  be  recommended 
for  their  robustness  of  validity. 

ft 

The  first  Hogg  adaptor  (MCI)  uses  a discrete  adaptive  approach  which 

chooses  among  a finite  set  of  _x>ssible  trimed  means.  The  particular 

choice  used  here  fails  at  15,  Exc(.50X),  and  Exp(X).  This  is  most 

unusual,  especially  in  view  of  the  success  of  individual  trimmed,  means. 

2 

i»te  that  in  Table  3,  vfaile  the  value  for  o,,  is  in  line  with  individual 

n 

2 

trimed  means,  the  value  for  a is  considerably  larger  in  magnitude. 

The  follovdng  arguments  should  indicate  in  part  that  this  explosion  of 
2 

c can  be  expected  to  occur  over  tne  whole  range  of  discrete  adaptors, 
denote  0 = 0(n,F)  to  emphasize  the  dependence  on  the  sample  size  n 
and  the  underlying  (continuous)  distribution  Sanction  F.  Typically 
n5(Q(n,F)  - 0(F))  N(0,  c*(Q))f  where  0(F)  is  some  constant.  Denote 

the  a- trimed  mean  by  S(n,a)  with  limit  value  S(a).  Suppose  F is 
symmetric  about  9 and  tliat  one  of  the  ’riots'  or  change  points'1'  in 
the  adaptor  is  0(F)  (for  example,  in  HG1  these  Inots  are  1.81  and 
1.37).  Then  asymptotically,  the  adaptor  is 

T*(n,F)  = Sfn.Oj)  if  Q(n,F)  s Q(F) 

= S(n,a2)  if  0(n,F)  > 0(F). 


Since  ’(n.F)  is  ever.  arid  Sfn  yy)  is  odd,  ry(";(n  f)  - 0(F))  and 
rrfS(n.a)  - S(a))  are  asymptotically  independent , 0(a)  = 0 and 

Pr{ns(i*(n  F)-0)az}  h |?r{r-' ; (3(r.  .c^)  -0 )<z}  + (S(n,a2)  -0)  ozj  (3) 

hence.  n - o)  does  not  c >ite  have  ?.  liniti.i?  normal  distribution 4 
alt  .curb  the  variance  rill  net  have  blown  up.  hr /aver  if  F is  asymmetric 
S(c-,)  - S(a2)  sr  i if  the  spproxi  ation  (3)  holds ; \re  see  that  for  no  0 
does  . '(T*(r.,F)  - 6)  nave  a proper  limit  distribution  (in  fact  rass 
■ ill  be  nlsced  at  +-'>  or  -»  ).  Those  very  heuristic  arguments  shot- 
that  t ituderitiz?. ei.cn  of  hard  adaptors  is  likely  to  fail  whenever  one 

is  ’.  fortu'.cta  enough  to  have  chosen  Q(F)  as  a knot.  Since  Q(IE)  « 1.804 
me  o(Fv)(,50h})  3 1- F|2  (the  latter  obtained  by  ? bate -Carlo  sa'inlinr 
with  ?S°  .rials);,  one  sees  the  probable  explanation  for  the  failure 


r'i.  if  these  cases.  In  . Sonte- Carlo  sanplinp  ve  obtained 
' (f; ''r  '»)  - 1.36  intuitively,  in  this  heavily  asjrxietric  situation  the 
choices  10"  and  2Si  are  so  different  as  to  provide  a boost 
to  the  variance,  however ; an  adequate  explanation  is  still  needed. 

It  would  seer,  then  that  a source  of  difficulty  is  that  T*(n.F) 
is  not  continuous  in  0(n,F)  for  all  n.  T;G2  provides  an.  example  which 
shows  tnat  the  continuity  merely  of  lit  T*(n.F)  = T*(F)  is  not  sufficient. 

p 

As  a first  constraint  then,  any  adaptive  estimate  should  have  the  above 
continuity  orenerty.  Although  ve  experimented  with  linear  functions  of 
order  statistics  ’rith  weight  factions  depending  smoothly  on  a (such 
as  piecewise  linear  and  quadratic),  v/e  had  very  little  success.  The 
problem  of  constructing  smoothly  adaptive  estimates  ’rith  reasonably 
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consistent  variance  estimates  in  s:  nil  sou  pies  is  an  open  and  v/ortkwhile 
nroblen. 


II . (I  .-esti’.acors) 

The  Ilufcor  estimate  -Li)  (generate!  by  i^(x)  = rnx(k,;  nin(x/K2))  > 

-k-i  ~ 1<2  = l.S  ) has  :,oo i robustness  and  breakdotin  properties  but  Carroll 
(1575a)  lias  show.  that  if  F is  as'/n  etric . s,.  -+  5,  then  there  exist 

non- zero  constants  a-, . a-,  with. 

(Tn  - TCP))  = a^'1  - 7(F)) j + a2(s-X  - Q + o(rf!i)  (a.s.).  (4) 

This  reans  that  the  variance  estimate  (1)  (suggested  by  Tuber  (1570) 
and  used  by  Cross  (1571)  ) is  not  consistent  for  asyrr.etric  distributions. 
This  is  clearly  reflected  ir.  Tables  2 and  3.  The  s ti’ dent i z at i on  suggested 
by  (1)  would  hold  if  a2  = 0,  which  is  equivalent  to 


:{(x  - T(F))f  Jd_1(X  - T (F) ) ) } = o. 


(5) 


One  possibility  would  be  to  choose  v, , k-  to  force  (3)  to  ’-sold,  at 
least  asymptotically.  Cin.ce  .iuber  estimates  have  such  nice  properties, 
some  future  wor'..  should  be  devoted  to  finding  a satisfactory  solution. 

That  the  jac'daiifed  I iuber  estivate  failed  v;as  both  surprising  and 
disappointing.  Cone  insight  into  the  cause  of  this  phenomena  is  available. 
Jaeckel  in  a:;  unpjblisbed  paper  indicates  that  showing  southing  like 

- T(F)  - a^'1  g - !(F))]J  0 (5) 

is  the  rissing  step  which  needs  to  be  verified  for  his  arguments  to  hold, 


— — 
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(4)  shows  that  (6)  fails.  Further,  it  is  well-known  that  the  jackknife 
does  not  provide  a consistent  estimate  of  the  variance  of  the  sanple 
median;  since  the  scale  sn  is  very  close  to  a sanple  median,  it  would 
seen  to  be  the  cause  of  some  instability.  If  tins  latter  point  is  the 
root  difficulty  (a  point  which  needs  to  be  settled) , perhaps  using  a 
scale  with  somewhat  smoother  influence  function  is  mandated.  One  possi- 
bility would  be  to  take  the  average  of  the  smallest  half  of 
{ - median|,  ...  , |Xj  - median | }.  ’.'e  have  done  some  partial  work 

In  this  direction  with  little  success. 

III.  (One- step  estimates) 

The  one-step  fi- estimates  and  their  adaptors  were  included  for  illus- 
trative purposes  although  they  would  not  be  used  in  obviously  asymmetric 
situatioris.  Carroll  (1975b)  lias  shown  that  an  analogue  to  (4)  exists 
in  this  case  as  well,  so  the  failures  at  ME  and  Exp(X)  are  not  unexpected. 
One  interesting  point  to  note  is  the  very  poor  performance  of  2 LA.  and 
7.SA  at  the  negative  exponential,  we  have  no  good  explanations  for  this. 

Gross  (1975),  in  surveying  the  disappointing  performance  (in  syi.r.etric 
situations)  of  jackknifed  versions  of  these  estimates,  recomends  replacing 
the  median  by  a smooth  starting  value.  Hie  arguments  of  the  previous 
section  indicate  an  additional  need  for  a smooth  scale. 
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Conclusions 

In  this  paper  we  have  shown  that  wide  classes  of  robust  es tiuvot-.es 
will  not  perform  adequately  in  the  two- sarnie  location  problem  in  the 
presence  of  asymmetry.  Vfhile  trimmed  means  work  quite  well,  discrete 
adaptive  trimmed  means  have  been  shown  to  degenerate  whenever  one  of 
the  knots  is  near  the  limiting  value  of  the  tail  length  functional  Q(n,F) 
evidently  only  smoothly  adaptive  trimmed  means  ’nave  any  hope  of  success. 

1- estimates  and  their  one- step  versions  have  been  shown  analytically 
to  lead  to  inconsistent  variance  estimates  if  the  <Jj- function  is  skew- 
sy: -metric  this  suggests  the  possibility  of  allowing  the  ^-function  to 
also  depend  on  the  data.  It  would  appear  tliat  !i- estimates  and  their 
j ac’dTiifed  versions  share  the  same  source  of  difficulty,  but  this  point 
needs  to  be  clarified. 
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